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Abstract 

We define the notion of linguistic structure on a small category, in order to provide a more formal 
description of ontology logs, also known as ologs, introduced in [20] by R. E. Kent and the second 
author. In particular, we construct a bicategory Eng, of English noun phrases and verb phrases, 
endorsed as functional by varying sets of authors. An olog is then defined as a lax functor to Eng. 
We then present a new notion of linguistic functor, which extends the notion of meaningful functors 
defined in [16]. Finally, we discuss the relationship between ologs and databases in this context. 
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1 Introduction 


The theory of ontology logs (ologs for short) was introduced by Robert Kent and the second 
author in their paper [20], as a framework for knowledge representation. Ologs are basi¬ 
cally mathematical categories that have been wrapped in natural-language English. They 
have been applied in several branches of science and engineering, [3, 5, 8, 9, 13, 19], as a 
tool for various kinds of formal modeling. 

Typically, a person who wishes to record and document some of her knowledge or ideas 
will do so in prose, e.g., a scientist publishes ideas in the form of research papers. Ologs 
offer the ability to express complex ideas using a special type of diagrams. Namely, the 
objects of study and the relationships between them can be represented as the objects and 
arrows in a category. The difference between an olog and a category is that an olog has 
additional structure: each object is labeled with a norm phrase and each arrow is labeled 
with a verb phrase, so that reading source-arrow-target yields an English sentence. These 
must satisfy certain rules, giving a set-theoretic semantics to the olog, which in turn allows 
it to serve a dual role as a database schema. There is a formula for composing sentences end- 
to-end into a new sentence, when following a path of arrows through an olog, and a pair 
of equivalent paths (also known as a commutative diagram) in the category is understood 
as a declared fact equating the two English sentences. 

The idea of regarding English sentences as mappings is not new. One interesting approach 
on this matter is given by the notion of conceptual metaphor [22] which links one idea (a 
source domain) to another (a target domain) to better understanding something. More for¬ 
mal approaches (compared to the previous one) to language, knowledge and information 
modeling have been developed by other authors, such as D. Kartsaklis, M. Sadrzadeh, S. 
Pulman and B. Coecke. In [12], for instance, they make use of notions from category the¬ 
ory, such as compact closed categories and strongly monoidal functors, to studying meaning in 
natural language. 

The primary goal of this paper is to present a reformulation of the previous concept of olog, 
which leads to a categorical and linguistic description of mappings between ologs. In our 
reformulation, any type, aspect, or fact (in the sense of [20]) in an olog must be endorsed 
by a set of people, who understand its meaning, i.e., the way it indicates a set or a function. 
This cultural understanding of the English language is captured by a certain bicategory Eng. 
The above idea that ologs are categories wrapped in English will be formalized by saying 
that ologs are categories mapping to Eng. Our notion of mapping between ologs falls out 
of that structure. The result is that a map between ologs is not just a functor between their 
underlying categories; it is a functor that respects the linguistic description on each node 
and arrow of the ologs. For example, there is an obvious functor F (namely the identity) 
between the underlying categories of the following ologs: 
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a man 

a woman 

is 

i 

has as weight (in kilograms) 

1 


an object 

a number between 20 and 500 




But would this functor F mean anything? We will rephrase this question in § 3.2 as follows: 
"does there exist an author who is willing to endorse a linguistic structure on FI" In this 
paper we explain the difference between a (linguistic) map of ologs and a mere functor be¬ 
tween their underlying categories. We will address this particular case (1) in Example 3.2.6. 

To some extent, this issue is considered in [16], where the authors introduce the concept of 
meaningful functor. One limitation of their approach is that it depends on a strong assump¬ 
tion, namely that every olog C is equipped with a functor I : C —*■ Set, and that this functor 
somehow controls the meaning of the olog. Such a set-valued functor is called an instan¬ 
tiation, a term coming from database theory [17]. The idea is that I represents a kind of 
database of examples, or instances, for the various types, aspects, and facts in the olog. For 
example, if an author is writing an olog C describing a familiar real-world situation, then 
for a type c (e.g., c = cat) the set 1(c) represents all the examples of c (all cats) known by 
the author. Somewhat strangely, however, there is no requirement in [16] that the database 
functor I should in any way correspond to the linguistic structure on the olog. A similar 
issue exists for functors between ologs in [16]. 

In this paper, we remedy these issues. First, we allow ologs to exist without being instanti¬ 
ated; that is we disentangle ologs and their instantiations. This way, the set of documented 
examples can evolve over time, without changing the olog to which they refer. On the 
other hand, we add a constraint to instantiations: for a set-valued functor to count as an in¬ 
stantiation of an olog C, it must conform to the linguistic structure, the labelings, on C. The 
same goes for mappings between ologs: in order for a functor to count as a mapping be¬ 
tween ologs, it must conform to the linguistic structures involved. We also take more care 
to explain the relationship between an olog and its authors. We introduce the concept of 
endorsement: an author can endorse that a certain concept or relationship between concepts 
makes sense, that a certain fact is true, etc. 

Here is an expert-level view of this paper. There is a bicategory Eng that denotes the English 
language as it divides into norm phrases, which indicate sets, and verb phrases, which 
indicate functions, where all of this "indicating" is decided solely by speakers. Given that 
such a bicategory Eng exists, an olog is just a small category C and a lax functor L : C —» Eng, 
called a linguistic structure on C. Allowing the base category to vary, we get a fibration 
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Olog —> Cat, where Olog denotes the category of ologs and Cat is the category of small 
categories. Instantiated ologs are defined similarly: there is a bicategory InstEng £ Eng x Set 
in which the noun phrase associated to each object is exemplified by a set, and each verb 
phrase associated to a morphism is exemplified by a function. An instantiated olog is a lax 
functor C —* InstEng, and we again have a fibration InstOlog —* Cat. All of this will be 
explained in the main sections of the paper. 

This paper is organized as follows: In § 2 we introduce the bicategory Eng of English lan¬ 
guage. We present the notion of author endorsement for noun phrases, verb phrases and 
equivalence between sentences, along with a list of linguistic guidelines needed to design 
any olog within Eng. § 3 introduces the category Olog of ologs, as well as the category 
InstOlog of instantiated ologs, by defining mappings between ologs and between instanti¬ 
ated ologs. To do so, we introduce the notion of linguistic functors and instantiated functors, 
as mentioned above. The latter of these is an adaptation of the "meaningful functor" notion 
defined in [16]. 

Most of the assertions on the bicategory Eng have a linguistic element and so are not purely 
mathematical. However, with the help of several linguistic postulates (mainly found in 
§ 2.2), fairly formal proofs are possible. Once Eng is given, the rest is straightforward 
category theory, thus giving to the theory of ologs a more solid theoretical basis. As the 
paper is mainly written for a non-mathematical audience, all proofs are given as prose 
arguments, merged with the text. 


Background and Notation 

We will assume the reader is familiar with some basic concepts from category theory, such 
as opposite categories, isomorphisms, functors, and natural transformations. Readers without 
category-theoretic background may still benefit from reading the less categorical defini¬ 
tions and results, skimming the category theory, and trying to digest the examples. The 
books [1, 18] are good sources for category theory, with many illustrations. No previous 
knowledge on bicategories, lax functors and lax transformations is needed; everything we 
say about these topics will be spelled out in concrete terms, but interested readers can check 
the book [14, Chapter 1] as an excellent source for a brief review on this matter. 

The word "category" will always mean a small category unless otherwise stated, i.e., the 
collections of objects and morphisms (also called arrows) are sets. Given a category C, we 
will denote the sets of objects and morphisms of C by Ob(C) and Mor(C), respectively. 
For every two objects c,c' e Ob(C), we will denote the set of morphisms from c to c by 
Homc(c, c'). The composition of two morphism /): c -*■ c' and f 2 ■ c' -* c" will be denoted 
using the semicolon symbol e.g., by /i; / 2 : c -> c". Our reason to use this notation is 
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to make the composition on morphisms in C parallel to the concatenation of sentences (see 
Definition 2.1.6) in any olog set on C. We denote the (large) category of sets, functions, and 
function composition by Set. 


2 English language as a bicategory 


One of the goals of this paper is to formalize the definition of olog. As we mentioned in the 
introduction, the idea is to define an olog as a category C together with a lax functor from 
C to a particular bicategory Eng, which encodes how English speakers understand their 
language. Details about bicategories, lax functors, and lax transformations will be given in 
the context of ologs, through the constructions of Eng, linguistic structures, and linguistic 
functors, respectively. 

We begin this section presenting the notions of norm phrase, verb phrase, and equivalence 
of sentences in § 2.1. These, along with the notion of author endorsement are going to 
be principle components defining the bicategory Eng, presented in § 2.3. As ologs are 
straddling the world of categories and linguistics, we must explain what it means for the 
labels on objects and morphisms to correspond to valid sets and functions. This concept 
of endorsement should satisfy certain guidelines, which we will present as (linguistic) pos¬ 
tulates. After assuming these postulates, we will be able in § 2.3 to regard the English 
language as a bicategory Eng, whose collections of 0-cells, 1-cells, and 2-cells formalize the 
concepts of types, aspects, and facts (respectively) that define an olog in the spirit of [20]. 
We formalize ologs as lax functors from small categories to Eng (see Definition 3.1.1). 

In § 2.4 we will equip each type in Eng with a set of instances, i.e., an author-specified set 
of examples corresponding to the linguistic label. Doing so, we obtain another bicategory, 
denoted InstEng, and following the same plan as above, we define instantiated ologs later 
in Definition 3.3.3. 


2.1 Nouns, verbs, and functional sentences 

When we speak of a norm phrase, we refer to what is more formally called a singular indef¬ 
inite noun phrase, such as "a person". If N is a symbol representing this noun phrase, we 
write ((A/’))="a person"; we call ((A f)) the reading of M (and say that A f is read ((A [))). We 
use similar notation for verb phrases. For us, a verb phrase V is that which connects one 
(singular indefinite) noun phrase, often called the subject, to another, often called the object. 
If ((V))="is" and ((A/ r, ))="a mammal", then we denote the concatenated string, "a person is 
a mammal" by ((A/’))((V))((A/ v )) or simply by ((AfVAf')). 
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Definition 2.1.1 A noun phrase is formed by the indefinite article a/an followed by an 
English noun which refers to a set-like concept, i.e., a concept for which an author can 
record a set of examples. ♦ 

Example 2.1.2 The noun phrase [a shirt] 1 refers to the set of shirts, i.e., things that can be 
called "a shirt". Similarly, the noun phrase [a prime number less than 10] refers to the set 
{ 2 , 3 , 5 , 7 }. ♦ 

In order to clarify the semantics, we impose a certain rule on our verb phrases: they must 
refer to functions, in the sense of set theory (see [20]). 

Definition 2.1.3 Given two noun phrases A/] and A/" 2 and a verb phrase V, one says that V 
functionally connects Ai to A/" 2 if the concatenation ((A/] VAT)) is an English sentence that 
refers to a mathematical function. In this case one says that E = (A/i, V, A/" 2 ) is a functional 
sentence. ♦ 

For example, given two noun phrases [a person] and [a woman], the arrow 


a person 


has as mother ,- 

-> a woman 


can be read as a sentence, "a person has as mother a woman". This sentence expresses 
that for anything that could be called "a person" there is something it "has as mother" 
that can itself be called "a woman". This seems true, but more important for us, it seems 
to represent a function: each person has only one mother. A reader who understands 
these concepts probably has an example of a person (namely himself) and the woman that 
corresponds to the mother he has. Thus the arrow text is a verb phrase that functionally 
connects [a person] and [a mother] in the sense of Definition 2.1.3. 

We can contrast the above situation with the following: 


a woman 


has as dog' 
-> 


a canine 


Because not every woman has a dog, and some women have two dogs, the arrow text is a 
verb phrase that does not correspond to a function; i.e., it does not functionally connect its 
source and target noun phrases. 


We next consider the notion of equivalent verb phrases, by which we mean verb phrases 


J As in [20, §2.1], we write [a shirt] instead of 


a shirt 


because some typographical problems emerge when writ¬ 


ing a text-box in a line of text. Note that the text-box seems out of place in this paragraph, and so many in-line 
text-boxes are troublesome for the aesthetic of this paper. 
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that indicate the same functional relationship. For example, if I say "An integer has as 
successor an integer", and you say "An integer yields, by adding 1, an integer", we are 
saying the same thing using different verbs. 

Definition 2.1.4 Two sentences E = (A/i, V, A4) and E' = (A/i, V', A/ 2 ) are said to be equiv¬ 
alent, denoted E - E', if they refer to the same functional relationship. If E - E', then their 
verb phrases are equivalent as well, and we denote V - V'■ 


Graphically, we will draw a checkmark ■/ between the two sentences E and E' if they—and 
hence their verb phrases V, V '—are equivalent, as shown in the following figure. 


V 



V' 


Figure 1: Equivalence between V and V'. 

As in [18, §2.3.3.4], an equivalence E ^ E' can be given the following English-language 
interpretation (where x, y \, and y-> are just symbols, to be copied verbatim): 

For any ((A/"i)) x, we know that x ((VA/ 2 )), that we call y\, and we know that x 
((V'A/ 2 )), that we call y 2 ; and the fact is, y \ and y 2 are the same for any x. 

Figure 2: Flow equivalences are interpreted as English. 


♦ 

Example 2.1.5 We explain how to use the syntax formula in Figure 2 to show that the 
following two sentences are equivalent: 


has as successor 


an integer 




an integer 


yields, by adding 1, 


The equivalence E ^ E' is read as follows: 

For any integer x, we know that x has as successor an integer, that we call y\, 
and we know that x yields by adding 1 an integer, that we call yy and the fact 
is, y\ and y 2 are the same for any x. 

We can agree that the previous statement is valid, since yi = x + 1 = y 2 , by the definition of 
successor, and thus E ^ E'. ♦ 
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Definition 2.1.6 Let £1 = (A/). Vi. A/" 2 ) and £ 2 = (A f 2 , V 2 , A/3) be two sentences. If A 2 = M' 2 , 
we say that £1 and E 2 are concatenatable. Let Vi; V 2 represent the verb phrase 

«Vi;V 2 )) := « V1A/2 )) "which" « V 2 )) - 

We define the concatenation of Ei and E 2 , denoted Ei; E 2 , to be the sentence 


Si; E 2 := (A/i, Vi; V 2 ,A/ 3 ). 


♦ 


Example 2.1.7 Consider the diagram below: 


an amino acid 


has 


_ A/*2 _ 

an amine group 


includes 


_ A/3 _ 

a nitrogen atom 


If ((Vi)) = "has" and ((V 2 )) = "includes", then the sentences Ei = (A/i, Vi, A4) and E 2 = 
(A/ 2 , V 2 , A/ 3 ) are concatenatable. Their concatenation is read 

((£ 1 ; E 2 )) = "an amino acid has an amine group, which includes a nitrogen atom". (2) 

♦ 

Example 2.1.8 In this example, we combine the concepts of equivalence and concatena¬ 
tion of sentences. The following diagram represents an equivalence between sentences, as 
indicated by the checkmark symbol ■/: 



First, note that the concatenation of the top and right arrows yields the sentence "a person 
lives at an address, which includes a city", and the diagonal arrow is read "a person lives 
in a city". According to Figure 2, the equivalence between these two sentences is read as 
the assertion 

For any person x, we know that x lives at an address which includes a city y\, 
and we know that x lives in a city y ->; and the fact is, y\ and y 2 are the same for 
any x. 

Contrast that fact to the following diagram: 
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a person 


lives in 


lives at 


a city 


has its most 
affluent residence at 


(3) 


an address 


includes 


a house number 


which does not represent a valid equivalence (note the absence of the checkmark symbol •/). 
In this case, the assertion indicating the equivalence between the two involved sentences 
would be read 


For any person x, we know that x lives in a city, which has its most affluent 
residence at a house number y\, and we know that x lives at an address, which 
includes a house number t/ 2 l and the fact is, yi and y -2 are the same for any x. 

This assertion is quite dubious, because there seem to be many city-dwellers whose house 
numbers are different from that of the most affluent resident in their city Hence, we as 
authors cannot endorse (3) as an equivalence. ♦ 

Definition 2.1.9 We say that a symbol £ is a linguistic expression if it represents a noun 
phrase, a verb phrase, a sentence, or an equivalence between sentences. ♦ 


2.2 Some postulates about endorsement 

In this section, we introduce the notion of author endorsement. This is the cultural aspect of 
the English (or any) language: noun phrases and verb phrases do not inherently represent 
sets and functions. The relationship between words and their meaning is, to some extent, 
author dependent. 

After defining endorsement, we provide some postulates about it. These postulates are 
what allows us to represent the English language as a bicategory Eng in § 2.3, which is the 
main ingredient in our formal definition of ologs. 

The following definition cannot, by its nature, be made mathematically rigorous. Indeed, 
it asks the reader to grant that there are things called people, and that people can evaluate 
whether verbs refer to functions, etc. 

Definition 2.2.1 Let s be a person who understands Definitions 2.1.1, 2.1.3, and 2.1.4. 
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(a) If Af is a noun phrase, we will say that s endorses A f, denoted s t= AT, if ((A f)) refers 
to a distinction made and recognizable by s, i.e., if s agrees that ./Visa set-like noun 
phrase, in the sense of Definition 2.1.1, and can imagine example elements of it. 

(b) Suppose that £ = (A/i. V. Ay) is a sentence and that .s t= Af\ and s 1= A/ 2 . We say that s 
endorses V, denoted s 1 = V, if s is willing to declare that V is functional, in the sense of 
Definition 2.1.3, and understands how examples of A/) correspond via V to examples 
of A /2 • In this case we also write s l= £. 

(c) If £ = (A/i, V,A/ 2 ) and £' = (A/i, V.Af 2 ) are two sentences such that s t= £ and s 1 = £', 
then we say that s endorses £ ^ £', denoted s l= [£ ^ £'], if s is willing to declare that 
£ ^ £' is an equivalence in the sense of Definition 2.1.4 and that the correspondences 
V and V' act identically on examples of A/i, as in (b). 

By an author set S we mean a set of people in which each member is able to decide whether 
or not a linguistic expression (see Definition 2.1.9) is valid in the sense of (a), (b), (c) above. 
If S is an author set and £ is a linguistic expression, we say that S endorses C, denoted 
S l= £, if s l= C for every s e S. ♦ 

Example 2.2.2 A reader may not be able to endorse the sentences £1 and £2 of Exam¬ 
ple 2.1.7 as functional, whereas others would endorse them. That is, some may not know 
whether every amino acid always contains exactly one amine group or whether every 
amine group contains exactly one nitrogen atom. But given that an author s endorses 
them (and we should mention that £1 and £2 are broadly endorsed). Postulate 2.2.3 below 
says that s must also endorse their concatenation £p £ 2 . ♦ 

Recall that a postulate is an idea suggested or assumed as true as the basis for reasoning, 
discussion, or belief. We make a few linguistic postulates throughout this section and § 2.4, 
starting with the next one. 

Linguistic postulate 2.2.3 Let £1 and £2 be two concatenatable sentences as in Defini¬ 
tion 2.1.6. If s endorses both sentences s 1 = £1 and s 1 = £ 2 , then we will assume that s 
endorses their concatenation, s 1= £ 1 ; £ 2 . ♦ 

The following postulate says that every olog author must endorse sentences like "a bottle 
is of course a bottle" as functional. We will later postulate (in 2.4.3) that such sentences 
correspond to identity functions on sets, in what we call instantiated English. 

Linguistic postulate 2.2.4 We will assume that there is a unique verb phrase e, read ((e)) = 
"is of course", such that if Af is a noun phrase and s is an author with s 1= N, then s 1= 
(AT, e,AT ). We call e the unit verb phrase. ♦ 

The following postulate says that the unit verb phrase introduced above is unital with 
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respect to the string concatenation defined in Definition 2.1.6. 

Linguistic postulate 2.2.5 Let e be the unit verb phrase as in Postulate 2.2.4. For every 
sentence E = (A/i, V, A/ 2 ) and any author s, if s t= E then 

s l= [E; (A/ 2 , e, A/ 2 ) - E] and s t= [(A/i, e, A/i); E - E]. 4 

Remark 2.2.6 Associativity of concatenation is always endorsed by any author endorsing 
the sentences involved, but this does not require a postulate. It follows from the associativ¬ 
ity of string concatenation. ♦ 


2.3 The bicategory Eng of English expressions 

In this section we explain how to combine the notions of noun phrase, verb phrase, equiv¬ 
alence, and endorsement to construct a bicategory based on the English language. 

Recall from Definition 2.2.1 the notion of endorsement (denoted by the symbol t=) for noun 
phrases, verb phrases, and equivalence between sentences. Recall also that by an author, 
we mean a person who understands Definitions 2.1.1, 2.1.3, and 2.1.4; an author set is a set 
of authors. 

Definition 2.3.1 

(a) A type N = ( J\f,T) consists of a noun phrase A f (see Definition 2.1.1) and an author 
set T such that T l= A f. The author set may be denoted Auth( N) := T. 

(b) Given two types N\ and N->, an aspect from N\ to Ay, denoted V : Ni -+ Ay, consists 
of a pair V = (V, A), where V is a verb phrase functionally connecting A/i to A /2 in the 
sense of Definition 2.1.3, where A is an author set such that A £ Au th(Ay) n Auth(Ay), 
and where A t= V . The author set may be denoted A u th (V ) := A. 

(c) Given two types Ay and N 2 , and two aspects Vi,V 2 : Ay -> N 2 , a fact from V\ to 

V 2 , denoted ly => V 2 , is given by an author set F £ Auth(ly) n Auth(V 2 ) and an 
endorsement F 1 = [Vi - V 2 ], in the sense of Definition 2.1.4. We allow an abuse of 
notation and write F :\i => V 2. 2 ♦ 

We will soon show (see Proposition 2.3.5) that (a), (b), and (c) above define the collection 
of 0-cells, 1-cells and 2-cells of a bicategory, which we denote Eng. Roughly speaking, we 
need to define two operations on 1-cells and 2-cells, usually called horizontal and vertical 
compositions, satisfying certain properties and coherence conditions. 

2 Notice that we use only to denote equivalence between sentences, and the double arrow "=>" to denote 
2-cells in Eng, which are formed by an equivalence between sentences and an endorsing author set. 
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Most of the work is dealing with author sets, a consideration with which many readers 
need not be overly concerned. The main idea, which one should really know in order to 
proceed, is that ordinary ("horizontal") composition in Eng involves the concatenation of 
sentences, from Definition 2.1.6. On a first reading, one may skip to § 2.4. On a more careful 
reading, on the other hand, a good way to understand this point in a more formal way, is 
to study first the collection of all noun phrases and verb phrases endorsed by a single 
author s, which will form a category Eng s . In that particular context, author endorsement 
becomes straightforward, allowing us to treat the composition of morphisms in Eng s as 
a mere concatenation of sentences. We provide the details about this in the next section. 
Once we have defined Eng s , we will focus on proving Proposition 2.3.5. 


The category of English expressions endorsed by a single author 

Proposition 2.3.2 Let s be an author. There is a category Eng s with the following collection 
of objects and morphisms: 

(a) Ob (Eng,) is the set of types in Eng which have the form A = (TV, s), i.e. r s l= AT. 

(b) Mor(Eng s ) is the set of aspects in Eng which have the form V = (V, s), i.e., s t= V. ♦ 

Proposition 2.3.2 is basically a consequence of some of the linguistic postulates in § 2.2. 
First, given two composable morphisms V\ : N± -* Ay and V 2 : N 2 -*■ A 3 , the composition of 
V\ and V 2 in Eng s is given by the concatenated verb phrase Vi; V 2 , which is endorsed by s 
by Postulate 2.2.3, and which is associative by Remark 2.2.6. Finally, for every object A in 
Eng s , one has that s endorses the sentence (A'", e, A f) by Postulate 2.2.4, because s\=J\f. The 
morphisms of the form (e, s): A -»• A, which we denote by id.v, are the identity morphisms 
in Eng, by Postulate 2.2.5. 

One interesting property about Eng s is that the equivalence between sentences is a congru¬ 
ence relation on Eng s , meaning it satisfies the conditions of Proposition 2.3.3. For every 
A, A' e Ob(Eng s ) and every V, V' e HomE ngs (AT, A'), we write 

V ^’ N ' V' 

if s l= [V - V' ]. The following result provides the necessary theoretical framework to define 
the horizontal composition of aspects and the vertical composition of facts in the bicategory 
structure of Eng. 

Proposition 2.3.3 For every author s, the relation satisfies the following properties: 

• For every N,N' e Ob(Eng s ), the relation is an equivalence relation on the 

homset HomE ngs (A. N'). That is, for any aspects V. VV" e HomE ngs (A, A') 
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(a) ~^’ N is reflexive: V ~^ ,N V . 

(b) ~^’ N is symmetric: V ~^’ N V' if, and only if, V' ~^’ N V'. 

(c) -™’ N is transitive: if V ~^’ N V' and V' ~^’ N V ", then V -™’ N V". 

• The collection -. s of equivalence relations defines a congruence on Eng s : 

(d) For every N, N', TV", if Vi ^’ N ' V[ and V 2 -f’ N " V 2 ', then V i; V 2 V{\ V 2 '. 


Vi 


Af 




vi 



♦ 


Remark 2.3.4 Although Propositions 2.3.2 and 2.3.3 are stated in terms of a single author 
s, they are also valid for any fixed author set S. ♦ 


Proof that ^ s is a congruence relation on Eng s [technical] 

Proposition 2.3.3 is a consequence of the linguistic postulates of § 2.2. For (a), if V : N -*• N 1 
is an aspect such that s endorses V as functional, then the syntax formula in Figure 2 is 
true: 


For any ((A f)) x, we know that x {{VAf)), that we call y\, and we know that x 
{{VAf)), that we call y 2 ; and the fact is, y\ and y 2 are the same for any x. 

Note that the functionality of V implies that y\ - y 2 . Condition (b), on the other hand, 
follows directly by applying the same syntax formula, since "=" is a symmetric relation. 
Condition (c) follows similarly. 

Condition (d) is more delicate to check. Suppose we are given aspects V \, V{: N -*• N 1 and 
V 2 , V 2 ': N' -> N" such that s t= [V\ ^ V{\ and s l= [V 2 - V 2 ]. Let x be any ((Af)). Then then 
following statement is true: 

We know that x((ViAf')), that we call y\, and we know that x((V{Af')), that we 
call y 2 ; and the fact is, y i and y 2 are the same. 

Set y = yi = y 2 . Since s l= [V 2 - V 2 '\, we have that the following statement is true for y: 

We know that y((V 2 N")), that we call Z\, and we know that y{(V 2 N")), that we 
call z 2 ; and the fact is, z\ and z 2 are the same. 
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Now, consider the following statement 

We know that for any ((Af)) x, we know that x((V\Af')), which (( V 2 A f")) r that we 
call w\, and we know that x((V[Af'}}, which ((V 2 Af")), that we call w 2 ; and the 
fact is, w\ and w 2 are the same. 

Since s 1 = Vi; V 2 and s 1 = Vf ; V 2 by Postulate 2.2.3, we have that w\ = zi and w 2 - z 2 - On the 
other hand, we know that z\ = z 2 . It follows that the previous statement is true, and hence 
s l= [V i; V 2 - ; V^] - 


Structure of Eng as a bicategory [technical] 


Denote by Ob(Eng) the collection of types in Eng. In order to see that (a), (b) and (c) from 
Definition 2.3.1 define a bicategory Eng, the first thing we need to know is that for each 
pair of types Ni, N 2 e Ob(Eng), there is a category Eng(Aq. N 2 ), whose objects are given by 
aspects V: Ah N 2 (i.e ., our 1-cells), and whose morphisms between aspects V], V 2 '• N\ -* 
A ’ 2 (i.e., our 2-cells) are given by the facts F: Vj V 2 . 


Vi 



V2 


Figure 3: 2-cells in Eng. 

The composition of facts in Eng(iV|. A ? 2 ) is called vertical composition: given two facts 
Fi : Vi => V 2 and F 2 ■ V 2 => V 3 , the composition is given by intersecting author sets: 


F 2 ° F\ := Fi n F2 l= [Vi - V 3 ]. (4) 

This composition defines a fact by Proposition 2.3.3 (c), and is associative since the inter¬ 
section of sets is. Now, for every aspect V : N\ Ay there is a unique identity fact at V, 
given by the endorsement Auth(D) 1 = [(A/i, V, A/" 2 ) - (A'’i. V. ) ] (see Proposition 2.3.3 (a)). 
It follows by definition that these facts, that we denote idy, are unital with respect to the 
previous composition. Therefore, Eng(A r ]. A 2 ) defines a category. 

To define horizontal composition we will need to supply, for every Ni,N 2 ,N 3 e Ob(Eng), 
a functor 

Eng(lVi, 1V 2 ) x Eng(A 2 ,IV 3 ) — Eng (N lt N 3 ). 
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We will denote it by 


( V, V') i-s- V; V' on aspects, 
(F, F') i-*- F • F' on facts. 


Horizontal composition of aspects V = (V, Au th( V )) works by intersecting author sets and 
concatenating verb phrases, as in Definition 2.1.6, i.e., 

V ; V' = ((V; V'), Auth(V) n Auth( V)) (5) 

Let Auth(V; V') := Auth(V) nAuth(W). 

Vertical composition of facts is given as follows. If we are given F: V\ => V 2 and F'\ V[ => 
Id/, define a fact F • F r : Vf; V[ => 14; V[ to be the endorsed equivalence 

FnF' \= [(Vi; V 2 ) - (V/; V 2 )] ( 6 ) 

(see Proposition 2.3.3 (d)). Note that this composition is associative at the level of aspects 
(see Remark 2.2.6). Moreover, for each type N e Ob(Eng), there is an identity aspect id,y 
in Eng( N, N ) given by the verb phrase e, as in Postulate 2.2.4, and endorsed by Auth( N ). 
Note that id a- is unital with respect to the horizontal composition at the level of aspects. At 
the level of facts, we can check that for each aspect V: Ni -*• iV 2 , there are isomorphisms 

V;idjv 2 => V and id^; V => V, 

since the equivalences [V;e] ^ V and [e; V] ^ V are endorsed by Auth(V) n Auth(A 2 ) = 
Auth(V) and Auth( V) n Auth(Ai) = Auth(V) by Postulate 2.2.5. 

Finally, according to [14, Definitions 1.2.5 & 1.5.1], it is only remains to verify that the 
previous associativity and unital isomorphisms satisfy the pentagon and triangle axioms 
(coherence conditions) to conclude that Eng is a bicategory, but this is a straightforward con¬ 
sequence of the previous constructions. Summarizing, we have proved Proposition 2.3.5. 

Proposition 2.3.5 The collection Eng of all types, aspects, and facts as in Definition 2.3.1, 
equipped with the vertical and horizontal compositions given by (4), (5), and ( 6 ), forms a 
bicategory. 4 

Remark 2.3.6 Let Vf => V 2 be a 2-cell in Eng, i.e., a fact. The direction "=>" is determined 
by the way the equivalence Vi - Vj is read according to Figure 2, i.e., we read the statement 
involving "((Vi))" first. Note by the symmetry of the relation - (see Proposition 2.3.3), that 
if V\ => I 2 is a fact with author set F, then so is V-i => V\. ♦ 
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2.4 The bicategory InstEng of instantiated English expressions 


In this section we add to Eng some additional structure by exemplifying each linguistic 
expression in it. That is, we bundle each type N with a set of examples, or as we call 
them, tokens, ’ where the tokens must match the norm phrase ((A f)). Thus, we obtain a new 
bicategory InstEng of instantiated English expressions, as shown in Proposition 2.4.5. Its 
0-cells, 1-cells and 2-cells will be described in Definition 2.4.4. The bicategory InstEng will 
provide the necessary theoretical framework to present the concept of instantiated olog. 

Just as for linguistic expressions, we need some postulates about author endorsement to 
govern the behavior of tokens with respect to noun phrases and verb phrases. For exam¬ 
ple, if you understand the noun phrase [a US president], then you probably endorse that 
Abraham Lincoln is an example, or token, of the corresponding set. 

Recall the notion of endorsement for noun phrases and functional verb phrases from Defi¬ 
nition 2 . 2 . 1 . 

Definition 2.4.1 

(a) Let M be a noun phrase. One says that a; is a token of A/" if x is an example to which 
the noun phrase ((A f)) applies, i.e., if the following sentence is true: 

AA(x):= a; "is" ((AT))- (7) 

If s is an endorsing author s 1 = A f and a; is a token of A f according to s, we say that s 
endorses a; as a token of AO this will be denoted by s \= (x : A f). 

(b) Let (A/i, V, A4) be a sentence, and let x and y be tokens of A/i and AO, respectively. 
One says that x corresponds to y via V if the following sentence, denoted V(a;, y), is 
true: 

V(x,y) := x "is" ((A/i)) "which" ((VAO)), "namely" y. (8) 

If s is endorsing author s 1 = (A/i, V, AO), s 1 = (a;: A/i), s 1 = (y ■ A/0), and if a: corresponds 
to y via V according to s, then we say that s endorses the correspondence V(x,y) 
between a’ and y; this will be denoted by s l= V(a;, y). 

If S is an author set, we say that S endorses either a token S l= (x : A f) or a correspondence 
between tokens S \= V(x, y), if every member of s endorses it, i.e., s t= (a; : N) or s t= V(x, y) 
for every s e S. ♦ 

3 The motivation behind the term "token" comes from the usual usage in philosophy, e.g., [21], Tokens are real- 
world examples and instances of abstract types. In this paper the word "example" has a wider connotation, 
while the term "instance" will be reserved for Definition 3.3.3. 
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Example 2.4.2 Consider the following sentence, which we endorse as functional: 

can, using the usual chronological 
ordering of presidents, be assigned 


We endorse that Abraham Lincoln is a token of P, that 16 is a token of I, and that they 
correspond via the verb phrase labeling the arrow. That is, we agree that: "Abraham Lin¬ 
coln is a US president which can, using the usual chronological ordering of presidents, be 
assigned an integer, namely 16." ♦ 

We now postulate that the correspondence ( 8 ) is functional, and that the unit verb phrase 
and composition of verb phrases act like the identity function and function composition 
for instances, respectively. 

Linguistic postulate 2.4.3 Let s be a person who understands Definition 2.4.1. 

(a) Suppose given an endorsement s i= (A/j. V. Ay). Then for every token x of Ni such 
that s 1 = (x : Ni), there exists a unique token y of N 2 such that s 1 = V(x, y). 

(b) Suppose given endorsements for concatenatable sentences, s l= (Afi,Vi,Af 2 ) and s l= 
(A/* 2 ,V 2 ,A/ 3 ). Note that s 1 = (A/),Vf; V 2 ,Af?,) by Postulate 2.2.3. Suppose also that s 
endorses tokens s l= (x : A/i), (y : A/ 2 ), (z : A/ 3 ) and correspondences s l= Vi(x,y ) and 
s l= V 2 (y, z). Then we will assume that s \= (Vi; V 2 )(x, z). 

(c) Suppose given an endorsement s 1 = (x ■ A f) for some noun phrase A f. Then we will 

assume that s l= e(x, x), with e as the unit verb phrase defined in Postulate 2.2.4. ♦ 

Recall the definitions of types, aspects, and facts from Definition 2.3.1, and the notion of 
tokens and their correspondences from Definition 2.4.1. 

Definition 2.4.4 

(a) An instantiated type is a pair N = ( N , T), where A is a type with noun phrase A f and 
T is a set such that for every x e T, we have an endorsement 

Auth( N) \= (x : A f) 

that x is a token of A f. We may write T( N) := T and refer to it the token set of N. 

(b) Suppose given two instantiated types Ni and N 2 . An instantiated aspect from N 1 
to N 2 , denoted V\ N 1 -*• N 2 , consists of a pair ( V, /), where V: AT -> N 2 is an aspect, 
and / is a function /: T(Ai) -*■ ¥(^2), such that for every token x s T(ATi), there is 
an endorsement 

Auth(U) 1 = V(x,/( x)), 


1 

an integer 


_ p _ 

a US president 
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where V is the verb phrase of V. We may write T'( V ) := / and refer to it as the token 
function of V. 

(c) Suppose given two instantiated aspects Vi, V 2 '■ A 1 -*■ N 2 with equal token functions, 
T(V"i) = T(D 2 ). An instantiated fact from V\ to Vi, denoted F: V\ => V 2 , consists 
of a fact F = F, where 

F 1 = [Vi — V 2 ] 

is a fact in Eng. ♦ 

As in § 2.3, one can prove that the collections of instantiated types, aspects and facts in 
Definition 2.4.4 form a bicategory InstEng £ Eng x Set. We do not give many details on 
this, since the proof proceeds as it did with Eng in the previous section. However, we will 
specify how vertical and horizontal compositions are defined. On a first reading, one may 
skip to Proposition 2.4.5. 


Structure of the bicategory InstEng [technical] 


Denote by Ob(lnstEng) the collection of instantiated types, as in Definition 2.4.4. For each 
pair Ai, A 2 e Ob(lnstEng), there is a category lnstEng(Ai,A 2 ) whose objects (i.e., 1-cells in 
InstEng) are given by instantiated aspects V: Ai -*■ N 2 , and whose morphisms (i.e., 2-cells 
in InstEng) V\ => V 2 are given by: 


^° m |nstEng(Ari,Af 2 )^ 1, ^ /2 ^ ' 


H ° m En g (v 1 ,iV 2 )( t/l ’ y2 ) ifT (^l)= T (^), 

0 otherwise. 


By the previous formula it follows that the vertical composition in InstEng is just the verti¬ 
cal composition in Eng, and identities are also straightforward. 


For every triple Ai, A 2 , A3 € Ob(lnstEng), there is a horizontal composition functor 

lnstEng(Ai, N 2 ) x InstEng (N 2 ,N 3 ) -* lnstEng( Ni, N 3 ), 

denoted V \\V 2 for every Hi e Ob(lnstEng(Ai, N 2 )) and V 2 e Ob(lnstEng(A 2 , A 3 )), acting 
on instantiated aspects according to the intersection of sets, concatenation of verb phrases, 
and composition of functions. 

At the level of instantiated facts, if we are given F: V\ => V 2 and F : V t => V 2 , then the 
instantiated fact 

f.f'-.v 

is given by the endorsed equivalence F n F t= [ V \: V t V 2 : V 2 j. Finally, for each instan- 
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tiated type N, there is an identity instantiated aspect id— given by unit the noun phrase e 
as in Postulate 2.2.4, endorsed by Auth(lV), and by the identity function id T ,^: T (N) -*■ 
T (N), which is unital with respect to the horizontal composition of instantiated aspects. 
Note that the previous operations are well defined by Postulate 2.4.3. 

Proposition 2.4.5 The collection of all types, aspects, and facts as in Definition 2.4.4 forms 
a bicategory, denoted InstEng. ♦ 

As was true with Eng, if we fix a single author s (resp. an author set S), there is a subcate¬ 
gory InstEng,, £ InstEng of instantiated linguistic expressions endorsed by s (resp. S ). There 
is no need to consider 2-cells in InstEng,., and composition is given simply by concatenating 
verb phrases and composing functions. 

The following result is just a matter of checking the conditions in [14, Definition 1.5.8]. 
Proposition 2.4.6 
(a) The mappings 


N TV, for every instantiated type N, and 
V V, for every instantiated aspect V. 

define an strict functor between bicategories U : InstEng —> Eng. 

(b) The mappings 

N >-»• T(iV'), for every instantiated type N, and 
V i-»- T(V), for every instantiated aspect V, 

define an strict functor between bicategories T: InstEng —* Set. 

Together, these define an inclusion ([/, T): InstEng Eng x Set. ♦ 

3 Ologs, instantiated ologs, and mappings between them 


We begin this section formalizing the definition of ologs found in [20], using a new construc¬ 
tion, which we will call a linguistic structure in Definition 3.1.1. We suggest that readers who 
are unfamiliar with ologs consult [16, §1, 2, 3, and 4] or [20, §1, 2, and 3]. 4 

4 Note that the later sections of [16], [20] also discuss pullbacks and pushouts within an olog; we do not address 
these notions in this paper. 
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In § 3.2, we define mappings between ologs, called linguistic functors (see Definition 3.2.1). 
These are mappings between categories that preserve the linguistic structure of the ologs. 
In Example 3.2.6, we explain this by providing two functors F and G, between the same 
two ologs, such that F is linguistic and G fails to be. In § 3.3 we define an instantiated 
olog to be a standard olog equipped with a set of examples that conforms to its linguistic 
structure. This formalizes the original notion of olog given in [20]. Finally, in § 3.4 we study 
mappings between instantiated ologs, called instantiated functors, which are instantiated 
analogues of linguistic functors. In particular, we provide a more linguistic version of the 
notion of meaningful functor given in [20]. All of the above is achieved by appealing to our 
definitions of Eng and Inst Eng from § 2. 


3.1 Linguistic structures 

Recall that an olog, as defined in [20], is a category whose objects and morphisms are 
labeled with noun phrases and verb phrases, respectively, in order to model a conceptual sit¬ 
uation. Commutative diagrams in an olog are called facts 5 ; they are equivalences between 
two sentences in the English language. Importantly, these types, aspects, and facts must 
follow certain guidelines, to ensure that the category-theoretic meaning is aligned with 
the conceptual and linguistic meaning intended by the authors. The Rules of Good Practice 
stated in [20, 2.1.2,2.2.3, and 2.3.4] are such a set of guidelines. An example of these rules is 
that the label of each arrow a -*■ b corresponds to a mathematical function [25]. All of these 
notions are author-dependent, as we made explicit using the notion of endorsement, as in 
Definition 2.2.1. After our work defining Eng in § 2, formalizing the notion of olog given in 
[20] is straightforward. 

Definition 3.1.1 Let C be a category. A linguistic structure on C is a lax functor L: C —> 
Eng. An olog is a pair (C, L), where C is a category, and L is a linguistic structure on C. ♦ 

The notion of lax functor, found in Definition 3.1.1, is standard in category theory literature; 
see for example [14, Definition 1.5.8]. However, we will now spell it out explicitly in our 
case. 

Remark 3.1.2 Let L: C — * Eng be a linguistic structure on C. Then: 

(a) Every c £ Ob(C) is mapped into a type L{c), consisting of a noun phrase C(c) and an 
endorsement Auth(L(c)) i= £(c). 

(b) Every / e Homc(c, c') is mapped into an aspect L(f), consisting of a functional verb 
phrase £(/): £(c) -»■ C{c') and an endorsement Auth(L(/)) 1= £(/). 

s Notice that in our proposed definition of facts, equivalences between sentences must be endorsed by an author 
set, and this is not explicitly mentioned in [20]. 
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(c) Let a^iicbea pair of composable morphisms in C, with h = f:<j. Then L being 
lax means that there 2-cell F: L(h ) => L(/); L(g), i.e., a fact, 

F\=[C(h)*C(fy,£(g)]. ♦ 

Example 3.1.3 We run through Definition 3.1.1 in the case of a category C, 



and a linguistic structure L on it, depicted here: 



(9) 


Each object has been assigned a noun phrase, and each arrow has been assigned a verb 
phrase. The two paths from 1 to 3, namely h and the composition of / followed by g, have 
been declared equivalent. 

We have not yet specified the author sets for the types, aspects, and facts above. This is 
arbitrary, but we specify a full example for completeness. Suppose we say 

Auth(l) := {A,B,C}, Auth(2) := {A}, Auth(3) := {A,B} 

Then we must have Auth(/) = 0 or Auth(/) = {A} because it must be a subset, Auth(/) £ 
Auth(l) n Auth(2); similarly for Auth(g). Let's say 

Auth(/) = Auth(g) := {A} and Auth(/i) = {B}. 

Since Auth(/;g) = Auth(/) n Auth(g) = {A}, but Auth(fi) = {B}, we must have F = 0. 
That is, given our choices, there is no one who is capable of endorsing the equivalence 
C(h) - £(/); C(g). However, it is still a fact because the diagram commutes in the ground 
category C. ♦ 


21 



















3.2 Mappings between ologs 


In this section we define the notion of morphism between ologs, which we call a linguistic 
functor, in Definition 3.2.1. After this, we can define the category of ologs, denoted Olog. 
We then give in Example 3.2.6 two ologs and two functors between them, one of which 
makes linguistic sense and the other does not, which recapitulates the issue we presented 
in our introductory example (1). 

Definition 3.2.1 Let (C,L) and (V. M) be ologs, as in Definition 3.1.1. A linguistic func¬ 
tor between them, denoted (F,F&): ( C,L ) —► ('D. M) consists of a functor F: C —> V 
together with a lax transformation F#: L M o F as shown below: 

C _ — _* V 

F* 

'Or 

Eng 

Figure 4: Linguistic functor. 

If F # is the identity, in which case L = M o F, we say that the linguistic structure L is the 
pullback of M along F and write L - F*(M). 

We define the category of ologs, denoted Olog, to be the category whose objects are ologs 
and whose morphisms are linguistic functors (see Remark 3.2.3). ♦ 

Remark 3.2.2 The morphism F# includes, for each c € Ob(C), an aspect Ff: L{c) 
M(Fc), called the c-component of F&, with endorsing author set Auth(F^). And for 
every / e Homc(c, F), it includes a fact 




L(c) 

L{f) 

L{C) 



M(Fc) 


F t * 


M(Ff) 



M(Fc') 


( 10 ) 


Both the arrows between objects in an olog, denoted and the component arrows for 
maps between ologs, denoted are assigned functional verb phrases. Although we use 
differently-shaped arrows to denote them, the equivalence in (10) is of the usual kind, as 
in Definition 2.1.4. 4 


22 





Remark 3.2.3 There is an evident functor Olog -*• Cat, sending an olog (C, L) to its under¬ 
lying category C. This is a Grothendieck fibration, meaning that if F: B -*■ C is any functor, 
there is a notion of cartesian morphism of ologs over F. This is just the pullback along 
F, in the sense of Definition 3.2.1. Grothendieck fibrations also come with a notion of ver¬ 
tical morphisms. These are the linguistic functors that restrict to the identity ide on the 
underlying category. ♦ 


Example 3.2.4 Consider the following linguistic structures L and M on the same category. 



L := 


M := 


2 


2 

a father 




a man 


l 

I 

1 


has 

I 


t 

includes 

I 

1 


a legitimate child 



a marriage 


includes 

I 


a mother 




a woman 

3 


3 


The phrase "legitimate child" is an old-fashioned term for a child who was born in a mar¬ 
riage, which itself is required to be between a man and a woman. Let S be an author set 
who endorses the linguistic expressions in L and M. Suppose these authors also endorse 
the following sentences as functional: 

"a legitimate child was born in a marriage" 

"a father is a man" 

"a mother is a woman". 

Technically speaking, they endorse the verb phrases ((oi )}:= "was born in"; ((02)):= "is"; 
and (( 0 : 3 )}:= "is" as components a c , for c e {1,2,3} = Ob(C), of a lax transformation a: L 
M. 

At this point, they have endorsed every object and arrow (both -* and in the diagram 
below: 


23 
































2 


2 



Figure 5: Example of a morphism a: L M between two linguistic structures on C. 

The 2-cells are drawn if each author s e S also endorses the corresponding equivalences. 
For example, a legitimate child has a father who is a man, and a legitimate child was born 
in a marriage, which includes a man. The point is, they had better be the same man(!) at 
least according to any authors endorsing a. ♦ 

Remark 3.2.5 Sometimes it is valuable to compare two linguistic structures L and M on 
the same category C, even if one cannot satisfy the somewhat strong conditions of a mor¬ 
phism of linguistic structures. Different ologs on the same underlying category have been 
considered before in other fields, such as biology and materials science [7]. In this case, one 
could consider a relaxed version of morphism a : L M in which the component aspects 
a c : L(c ) -■» M(c) were not required to be functional, just "conceptual", requiring a relaxed 
version of Eng. We will not explain how to do this category-theoretically, although it is 
possible. Instead, we just give an example to clarify the idea. 

Consider the left and right column, each a linguistic structure on the category | • -> • | . 

in a presidential system 
corresponds to 




in a presidential system 
corresponds to 


a presidential 
government 


has as head 


a president 


a parliamentary 
government 


has as head 

I 

a prime minister 


We cannot endorse the component maps between them (labeled, "in a presidential sys¬ 
tem corresponds to") as being functional. However, the diagram still shows a valid anal- 
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ogy between two forms of government. Analogies have been considered from a category- 
theoretic perspective before; see [4]. ♦ 

Example 3.2.6 In this example we will show two ologs (C, L) and ( V , M), and two func¬ 
tors F,G: C —* T> between their underlying categories. We will find that it is straightfor¬ 
ward to put a linguistic structure on F. On the other hand, we will explain why there is 
very little chance any author will endorse a linguistic structure on G. This way, we will 
explain an example from the introduction, § 1, namely the lack of linguistic structure on 
the functor in (1). 

Consider the following ologs: 

(C,L):= 

1 

a man 


is 


2 

an object 


(P, M):= 


a woman 


has as weight 
(in kilograms) 


a number 
between 
20 and 500 




an animal 


has as weight 
(in kilograms) 


a number 


Let F: C —*• V be the unique functor such that F( 1) = c and F( 2) = d, and let G : C —*• V 
be the unique functor such that G(l) = a and G(2) = b. Then the pulled back linguistic 
structures F*(M ) and G*(M) on C are shown on the left and right below. 


(C, F*(M)) := (C,L):= (C,G*(M)):= 


1 


1 

an animal 




a man 


has as weight 
(in kilograms) 


F* ? 


is 


a number 




an object 


G* ? 


1 

a woman 


has as weight 
(in kilograms) 


2 

a number 
between 
20 and 120 


( 11 ) 
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In order to extend F and G to linguistic functors, we need lax transformation F#: L 
M o F and G#: L M o G, as indicated in (11). We begin by endorsing a certain F#, and 
then return to explaining the difficulty of finding G&. 

To provide F#, we first need two component aspects; we use those labeling the top and 
bottom maps here: 


a man 


an animal 




has as weight 
(in kilograms) 


an object 


has as weight (in kilograms) 


a number 


( 12 ) 


It just suffices to ask whether, if we measure the weight (in kilograms) of a man, regarding 
him either as an object or as an animal, we get the same number. We endorse that fact, thus 
providing a linguistic functor (F, F&): (C , L ) —* (T>, M). 

On the other hand, it is difficult to do the same for G& as in the right side of (11). One 
can find functional verb phrases connecting [a man] to [a woman], for example "has as 
mother". However, it is not straightforward to find a functional verb phrase that connects 
[an object] to [a number between 20 and 120] making the necessary diagram (analogous to 
(12)) an endorsable equivalence between sentences. The reason, roughly, is that once one 
has regarded a man as an object, there is no aspect that applies to an arbitrary object which 
will correspond with the mother-having aspect of an arbitrary man. ♦ 


The upshot of the above example is that linguistic structures give the necessary semantics 
to constrain mappings between ologs. This has important applications to databases, as we 
will show in Example 3.4.6. 


3.3 Instantiated linguistic structures 

In this section we study instantiated ologs and mappings between them. These are ologs 
for which each type has been assigned a set of examples. The authors of an olog should, 
and generally do, know more than just some types and relationships; they should also 
have in mind some examples of these types and relationships. For example, someone who 
writes an olog about dogs, say including the arrow [a dog] —^ [a name], probably knows 
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some examples of dogs and their names. This information can be stored in an instantiation 
of the olog, which we will define in Definition 3.3.3. 


Instantiating an olog—filling it with conforming data—serves three purposes: 

i. It gives users a place to store data about—examples of—their subject of interest, 

ii. It validates the olog as a mathematical structure. 

iii. It differentiates between different author sets who endorse the same conceptual scheme 
(we mean a noun phrase, a sentence, an equivalence between sentences, or an olog). 

The first of these purposes is probably the most important, but it is also straightforward, so 
we briefly explain the other two. Issues of functional connectivity and endorsed facts (see 
Definition 2.1.3 and 2.1.4) rely on the authors' understanding of mathematical functions 
and their compositions. By instantiating an olog, the users validate that understanding. 


Another reason to instantiate an olog is to differentiate one group of authors from another, 
even if they use the same conceptual scheme. For example, consider the following linguis¬ 
tic structure: 


L = 


a person 


has 


-> 


a father 


(13) 


One author set may be interested in the fathers of US politicians ( e.g ., George W. Bush's 
father is George H. W. Bush), whereas another set may be interested in the fathers of fa¬ 
mous mathematicians (e.g., Emmy Noether's father is Max Noether). The same linguistic 
expressions in an olog can house multiple instantiations. 

The mathematical motivation behind instantiations comes from the concept of set-valued 
functors C —* Set as database instances, introduced by [6] and rediscovered by the second 
author in [17]. The same notion was defined for ologs in [20], where it was assumed that 
every olog comes equipped with such a functor C —*• Set. We find three problems with 
this: 

i. An olog can exist before one has recorded the corresponding examples. 

ii. The examples should have something to do with the linguistics expressions set in the 
olog. 

iii. Two authors may endorse the same linguistic expressions in an olog but have differ¬ 
ent examples. 

We have commented on iii above, and i is straightforward. We explain ii, which is probably 
the most important, in Remark 3.3.1. 
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Remark 3.3.1 In [20], there was no assurance that the functor I : C —* Set had anything 
to do with the linguistic expressions on C. So a type c = [a dog] would be mapped to a set, 
but there was nothing ensuring that it was a set of dogs. Of course, such a thing cannot be 
ensured mathematically, but in Definition 2.4.1 we did the next best thing, by providing a 
sentence for authors to endorse. ♦ 

Example 3.3.2 Consider the person-father olog (C, L) from Figure (13), reproduced here: 


a person 


has 


-> 


a father 


An author s might endorse that George W. Bush is a person, i.e., s l= (George W. Bush 
: a person), and similarly that George H. W. Bush is a father. Suppose, following (8) that s 
also agrees with the sentence: 

"George W. Bush is a person, which has a father, namely George H. W. Bush." 

Then s endorses that "has" is a correspondence between George W. Bush as a person and 
George H. W. Bush as a father. ♦ 

Let U' instEng —*• Eng and T: InstEng —> Set be the functors from Proposition 2.4.6, and 
recall from Proposition 2.4.6 that the induced functor InstEng ^ Eng x Set is an inclusion. 
The following definition is motivated to adapt the notion of instance functor given in [20] 
to our concept of linguistic structures. 

Definition 3.3.3 Let C be a category. An instantiated linguistic structure on C is a functor 
L\ C -* InstEng. An instantiated olog is a pair (C, L), where L is an instantiated linguistic 
structure on C. 

Given a linguistic structure L : C —*• Eng and a set-valued functor I : C —*• Set, we say that 
/ conforms to L if (L, I) factors through InstEng, 

(L,I) 

C -> Eng x Set 

InstEng 

Figure 6: Functors C —» Set conforming linguistic structures. 

The lax functor L: C —■> InstEng will be sometimes referred as an instantiation of L. We 
may refer to L and I (respectively) as the underlying linguistic structure and underlying 
instance of L. ♦ 
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It is this notion, that an instance should conform to the linguistic structure, which we find 
missing in [20]; see Remark 3.3.1. 

Remark 3.3.4 If L: C —*• InstEng is a lax functor, then we obtain an instantiated linguistic 
structure ( L,I ) on C by setting L U ° L and I := T' o L; by definition the instance I 
conforms to the linguistic structure L. We therefore do not distinguish between functors 
C —* InstEng and a pair (L. I) of functors L: C —* Eng and I : C —*• Set, with L lax, such 
that I conforms to L. ♦ 

Example 3.3.5 Recall again the person-father olog (C, L) from Figure 13, reproduced here: 


a person 


has 


-> 


a father 


We adopt a tabular description similar to that used in [17], except with column headings 
taken from (8). Using it, we can record the data of a functor I : C —*• Set as follows: 6 


This table then shows three correspondences, associated to the arrow labeled "has": 

• George W. Bush is a person, which has a father, namely George H. W. Bush; 

• Jeb Bush is a person, which has a father, namely George H. W. Bush; and 

• Emmy Noether is a person, which has a father, namely Max Noether. 

An author who endorses the six tokens and three correspondences shown here then also 
endorses I as an instantiation of the olog (C, L). ♦ 

Remark 3.3.6 The rules of instantiated English, as presented in Definition 2.4.4, can be un¬ 
derstood in terms of how people should interact with a database, as in [17]. The database 
corresponding to an olog would have tables named by noun phrases and columns labeled 
by verb phrases. A person should only enter a new row in table N if they understand (en¬ 
dorse) that noun phrase as a set, and thus can evaluate whether something was a member 
of it or not. A person should only fill a cell in column V if they understand how it is a 
function, and thus can evaluate how the row corresponds to a row in some foreign table. 

♦ 


a father 

George H. W. Bush 
Max Noether 
Bill Clinton 


a person 

has a father, namely 

George W. Bush 

George H. W. Bush 

Jeb Bush 

George H. W. Bush 

Emmy Noether 

Max Noether 


6 For a sentence (A/i,V,A/ 2), the instance will fit into a table with {(Ah)) as the head of the first column and 
((VAh)) as the head of the second column. 
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3.4 Instantiated functors 


We finish this section providing a notion of mappings between two instantiated ologs (C, L ) 
and (V. M) in terms of lax transformations, as in Definition 3.2.1, by constraining the set 
of linguistic functors ( C,L ) —*• ( T>,M ) to those that respect the tokens and their corre¬ 
spondences (See Definition 2.4.1). We will see in Remark 3.4.3 how this concept adapts the 
notion of meaningful functor introduced in [16, §4] to the context of linguistic functors. 

Definition 3.4.1 Let (C,L) and (fD, M) be instantiated ologs, as in Definition 3.3.3. An 
linguistic functor between them, denoted (F, F^): (C, L ) —*■ ( V , M) consists of a functor 
F ■ C — *■ V together with a lax transformation F^: L M o F as shown below: 



InstEng 


Figure 7: Instantiated functor. 

If F^ is the identity, in which case L = M o F, we say that the linguistic structure L is the 
pullback of M along F and write L = F*(M). 

We define the category of instantiated ologs, denoted instOlog, to be the category whose 
objects are instantiated ologs and whose morphisms are instantiated linguistic functors. 4 

Definition 3.4.2 Suppose that L,M: C —* InstEng are instantiated linguistic structures on 
C, where L - (L, I) and M = (M, J) are the underlying linguistic structures and conform¬ 
ing instances. If a\ L M is a morphism of linguistic structures, i.e., a lax transformation, 
and p: I => J is a morphism of instances, i.e., a natural transformation. 



Eng 


Set 


Figure 8: Morphisms a of linguistic structures, and morphisms p of instances. 
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then we say that p conforms to a if the lax transformation (a,p): C -* Eng x Set factors 
through InstEng ^ Eng x Set. 

Explicitly, for each object c s C, we have a verb phrase a c : /.(c) M(c) and a function 
Pc'- 1(c) -*■ J(c). We say that p conforms to a if for each c e C and token x e 1(c), each 
author s e Auth(a c ) endorses the correspondence a c (x,p c (x)). ♦ 

Remark 3.4.3 In [16], a mapping between ologs is called a meaningful functor; it is given by 
a functor F : C —> T> between two ologs C and V, and a natural transformation p: I => JoF, 
where I : C —*• Set and J: V —*• Set are instances of C and 'D. One particular issue with this 
notion is that there is no guarantee that the data defined by p conforms to the linguistics 
defined by F (nor that / and J conform to the linguistic structures on the olog). 

Our definition of instantiated functor remedies these issues. It is not hard to note from 
Definitions 3.4.1 and 3.4.2, along with Proposition 2.4.6, that every instantiated functor 
(C, L ) —s- (V 1 M ) is given by a functor F: C —> T>, a lax transformation F^: L M o F, 
and a natural transformation F i : I => J o F conforming to F\ where L = U ° L, M = U ° M, 
I = T o L and J = T o M as in Remark 3.3.4. This can be understood better with the help of 
Example 3.4.6. ♦ 


In some situations, it is more natural to consider morphisms between instantiated ologs by 
reversing the lax transformations and natural transformations involved. Doing this yields 
another category of instantiated ologs, in which all the results obtained so far also hold. We 
explain this better in the following remark. 

Remark 3.4.4 Suppose we are given an inclusion functor of categories *: C —> T>, i.e., C 
is a subcategory of V, and that both are underlying categories of instantiated linguistic 
structures. Then there should be more people who understand C, and therefore more data 
on C (than V). For example, consider the following inclusion of ologs: 


a person 


has as mother 

n 

a person 


So in this case, there would be a map of instances q : J o / => /, where I is an instance on C 
and J is an instance on V. There would also be a map of linguistic structures ft: M o i L, 
where L is a linguistic structure on C and M is a linguistic structure on V, to which q 
conforms. In fact, note that if J ° i =► / is the direction of instances, then it is only natural 
that the linguistics could go "the other way" 
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InstEng 


We define the category Inst0log + of co-instantiated ologs to have the same objects as InstOlog, 
but where morphisms are given as in (14)7 ♦ 

Remark 3.4.5 As we did with the category Olog in Remark 3.2.3, we note there is an evi¬ 
dent functor InstOlog -> Cat, sending every instantiated olog (C. L) to its underlying cate¬ 
gory C. This is a Grothendieck fibration whose cartesian arrows coincide with the strongly 
meaningful functors defined in [16, §4]. ♦ 

Example 3.4.6 Suppose we are given two categories C, V, which we think of as database 
schemas (as in [17]), a functor F : C —*■ V, and two instantiations I : C —> Set and J: V —* 
Set. Suppose that these two databases are to be merged. We are asked to find a morphism 
F ■ I => J o F. In this example, we show that our job will be easier if F has been equipped 
with a morphism F# of linguistic structures, so we can find F k conforming to F&. 

In order to emphasize the issue, we suppose that C and V are both single-object categories 

C = g]amiP = [H] 

equipped with the following linguistic structures: 


0 C,L ) 


1 


a 


a human 


and (V, A7) = 


a person 






Suppose we want to compare two instantiations, I : C —> Set and J: 'D —* Set, which are 
represented by the following tables: 


J(a) := 


7 The t symbol in our case has nothing to do with the dagger category [23]. 


7(1) := 


a human 


Emmy Noether 


George W. Bush 


a person 

Emmy Noether 
Max Noether 
Bill Clinton 
George H. W. Bush 
George W. Bush 
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To compare instances on different schemas, we first need a functor between them. In our 
case there is a unique functor F : C —*• V (it sends 1 *-*- a), so this is not an issue. With this 
functor in hand, we can pull back J to an instantiation ./ o F on C, and attempt to compare 
it to I. 


The purpose of this example is to show that the choice of linguistic structure F# on F is 
an important aid to making this comparison, i.e., to choosing a database homomorphism 
I => J o F out of the 5 2 = 25 possible choices. Consider the following two morphisms for F: 


a human 


a person 


a human 


has as father 


a person 


We denote the first by a and the second by /?, i.e., a. B ■ L M oF. Clearly, these morphisms 
give a useful hint at the intended semantics for the mapping. The only natural transforma¬ 
tion I => J o F we endorse as conforming to a (see Definition 3.4.2) is p, and the only one 
that conforms to /? is q, as shown below: 


a human 

is a person, namely 

Emmy Noether 

Emmy Noether 

George W. Bush 

George W. Bush 


a human 

has as father a person, namely 

Emmy Noether 

Max Noether 

George W. Bush 

George H. W. Bush 


This example complements, but also extends the scope of "meaningful functors" found in 
[16, §4]. ♦ 
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